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cells in the organism. As noted be,ow, however, ,he expression of the genes 
makmg up the genomic DNA may vary between individual cells. 

cDNA and cDNA Libraries 

Within a given cell, tissue or organism, there exist myriad mRNA species 
each encoding a separate and specific protein. This fact provides a powerft.1 tooi 
«o mvestigators interested in studying genetic expression in a tissue or cell - 
mRNA molecules may be isolated and further mamputated by various molecular 
hologtca. techniques, thereby allowing the elucidation of the fu,l fictional 
genetic content of a cell, tissue or organism. 

One common approach to the study of gene expression is the production 
of complementary DNA (cDNA) clones. ,,,„is technique, the mRNA molecules 
from an organism are isolated from an extract of the cells or tissues of the 
orgasm. This isolation often employs so,id chromatography matrices, such as 
celhnose or hydroxyapatite, towhich oligomers of deoxythymidine (dT) have been 
complexed. Since the V termini on al, eukaryotic mRNA molecules contain a 
stnng of deoxyadenosine (dA) bases, and since dA binds to dT, the mRNA 

mceculescanberapidlypurifiedfromothermoleculesandsubs^ncesinaetissue 
or cell extract. From these purified mRNA molecules, cDNA copies may be made 

usmg the enzyme reverse transcriptase, which resuUs in the production of singte- 
stranded cDNA molecules. The sin^e-stranded cDNAs may then be converted 
mto a complete double-stranded DNA copy of the origmal mRNA (and thus of the 
ongmal double-stranded DNA sequence, encoding this mRNA, contained in the 
genome of the organism) by the action of a DNA polymerase. The protein- 
specfic double-stranded cDNAs can then be inserted into a p,asmid, which is then 
mtroduced into a host bacteria, cell. The bacteria, cells are men grown in cuhure 
med,a, resulting i„ . p „ puIation of bacterjal ^ ^ ^ 

expressing) the gene of interest. 

This entire process, from isolation of mRNA to insertion of the cDNA into 
a Plasmid to growth of bacteria, populations containing , ne , solated „ 
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termed M cDNA cloning." If cDNAs are prepared from a number of different 
mRNAs, the resulting set of cDNAs is called a "cDN A library," representing the 
different functional (i.e., expressed) genes present in the source cell, tissue or 
organism. Genotypic analysis of these cDNA libraries can yield much information 
on the structure and function of the organisms from which they were derived. 

DNA Amplification 

In order to increase the copy number of, or "amplify," specific sequences 
of DNA in a sample, investigators have relied on a number of amplification 
techniques. A commonly used amplification technique is the Polymerase Chain 
Reaction ("PCR") method described by Mullis and colleagues (U.S. Pat. Nos. 
4,683,195; 4,683,202; and 4,800,159). This method uses "primer" sequences 
which are complementary to opposing regions on the DNA sequence to be 
amplified. These primers are added to the DNA target sample, along with a molar 
excess of nucleotide bases and a DNA polymerase (e.g., Taq polymerase), and the 
primers bind to their target via base-specific binding interactions (i.e., adenine 
binds to thymine, cytosine to guanine). By repeatedly passing the reaction mixture 
through cycles of increasing and decreasing temperatures (to allow dissociation 
of the two DNA strands on the target sequence, synthesis of complementary 
copies of each strand by the polymerase, and re-annealing of the new 
complementary strands), the copy number of a particular sequence of DNA may 
be rapidly increased. 

Other techniques for amplification of target nucleic acid sequences have 
also been developed. For example, Walker et al. (U.S. Pat. No. 5,455,166; 
EP 0 684 315) described a method called Strand Displacement Amplification 
(SDA), which differs from PCR in that it operates at a single temperature and uses 
a polymerase/endonuclease combination of enzymes to generate single- stranded 
fragments of the target DNA sequence, which then serve as templates for the 
production of complementary DNA (cDNA) strands. An alternative amplification 
procedure, termed Nucleic Acid Sequence-Based Amplification (NASBA) was 



-5- 



disclosed by Davey et al. (U.S. Pat. No. 5,409,818; EP 0 329 822). Similar to 
SDA, NASBA employs an isothermal reaction, but is based on the use of RNA 
primers for amplification rather than DNA primers as in PCR or SDA. Another 
known amplification procedure includes Promoter Ligation Activated 
Transcriptase (LAT) described by Berninger et al. (U. S. Patent No. 5, 1 94,370). 

PCR-based DNA Fingerprinting 

Despite the availability of a variety of amplification techniques, most DNA 
fingerprinting methods rely on PCR for amplification, taking advantage of the 
well-characterized protocols and automation available for this technique. 
Examples of these PCR-based fingerprinting techniques include Random 
Amplified Polymorphic DNA (RAPD) analysis (Williams, J.G.K. et al., Nucl. 
Acids Res. J8(22):653l-6535 (1990)), Arbitrarily Primed PCR (AP-PCR; Welsh, 
J., and McClelland, M., Nucl. Acids Res. 18(24):12\3-12\% (1990)), DNA 
Amplification Fingerprinting (DAF; Caetano-Anolles et al, Bio/Technology 
P.553-557 (1991)), and microsatellite PCR or Directed Amplification of 
Minisatellite-region DNA (DAMD; Heath, D.D. et al, Nucl Acids Res. 
2/f2¥):5782-5785 (1993)). All of these methods are based on the amplification 
of random DNA fragments by PCR, using arbitrarily chosen primers. 

DNA Sequencing 

In general, two techniques have been traditionally used to sequence nucleic 
acids. In the first method, termed "Maxam and Gilbert sequencing" after its co- 
developers (Maxam, A.M. and Gilbert, W., Proc. Natl. Acad. Sci. USA 74:560- 
564, 1977), DNA is radiolabeled, divided into four samples and treated with 
chemicals that selectively destroy specific nucleotides bases in the DNA and cleave 
the molecule at the sites of damage. By separating the resultant fragments into 
discrete bands by gel electrophoresis and exposing the gel to X-ray film, the 
sequence of the original DNA molecule can be read from the film. This technique 
has been used to determine the sequences of certain complex DNA molecules, 
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including the primate virus SV40 (Fiers, W., et al, Nature 273: 1 13-120, 1978; 
Reddy, V.B., et al, Science 200:494-502, 1978) and the bacterial plasmid 
PBR322 (Sutcliffe, G., Cold Spring Harbor Symp. Quant. Biol 43:444-448 
1975). 

An alternative technique for sequencing, named "Sanger sequencing" after 
its developer (Sanger, F., and Coulson, A.R., J. Mol Biol. 94:444,448, 1975), i s 
more commonly employed. This method uses the DNA-synthesizing activity of 
DNA polymerases which, when combined with mixtures of reaction-terminating 
dideoxynucleoside triphosphates (Sanger, F., et al, Proc. Natl Acad. Sci. USA 
74:5463-5467, 1977) and a short primer (either of which may be detectably 
labeled), gives rise to a series of newly synthesized DNA fragments specifically 
terminated at one of the four dideoxy bases. These fragments are then resolved 
by gel electrophoresis and the sequence determined as described for Maxam and 
Gilbert sequencing above. By carrying out four separate reactions (once with 
each ddNTP), the sequences of even fairly complex DNA molecules may rapidly 
be determined (Sanger, F., et al, Nature 265:678-695, 1977; Barnes, W., Meth. 
Enzymol 152:538-556, 1987). While Sanger sequencing usually employs K coli 
or T7 DNA polymerase (U. S. Patent No. 4,795,699), recent modifications of this 
technique using T7 polymerase mutants allow sequencing to be accomplished 
using a single sequencing reaction containing all four chain-terminating ddNTPs 
at different concentrations (U.S. Patent Nos. 4,962,020 and 5, 173,41 1). Further 
modifications to the technique, to reduce or eliminate the buildup of reaction- 
poisoning pyrophosphate in the reaction mixtures, have also been described (U.S. 
Patent No. 5,498,523). Other variations for sequencing nucleic acid molecules 
have also been described (see Murray, Nucl Acids. Res. 17:8889, 1989; and 
Craxton, Methods: A Comparison to Methods in Enzymology, 3:20-25, 1991). 

Limitations 

As noted above, the faithful and high-fidelity copying of a template nucleic 
acid molecule is an essential step in the synthesis of a nucleic acid molecule in 



amplification, reverse transcription, and sequencing protocols. However, the use 
of standard compositions and protocols to accomplish this synthesis is often 
inefficient, in that they tend to terminate nucleic acid synthesis prematurely at 
certain secondary structural (Gerard, G.F., et al, FOCUS 7/(4):60 (1989); 
Myers, T.W., and Gelfand, D.H., Biochemistry 30:7661 (1991)) and sequence 
(Messer, L.I., etal., Virol. 146:146 (1985)); Abbotts, J., et al, J. Biol. Chem. 
268: 103 12-10323 (1993)) barriers in nucleic acid templates. This is particularly 
true for template sequences that have high guanine/cytosine content (i.e., "GC- 
rich" templates) and those that are fairly large in size (i.e., templates that are larger 
than about 3-5 kb in length). These secondary structural and sequence barriers in 
the template nucleic acid molecules occur frequently at homopolymer stretches 
(Messer, L.I., etal, Virol. 146:146 (1985); Huber, HE., etal., J. Biol. Chem. 
264:4669-4678 (1989); Myers, T.W., and Gelfand, D.H., Biochemistry 30:7661 
(1991)) and are more often sequence rather than secondary structural barriers 
(Abbotts, J., et ai, J. Biol. Chem. 268:10312-10323 (1993)). If these barriers 
could be overcome, yield of total and full-length nucleic acid products in synthesis 
reactions could be increased. 

Some reports have indicated that modulation of the ionic strength or 
osmolality of the reaction mixtures, particularly of the concentration of Na + and 
K + ions, may influence the secondary structure and condensation of nucleic acids 
in vitro much as they do in vivo (Le Rudulier, D., et al. , Science 224: 1 064 ( 1 984); 
Buche, A., et al., J. Biomolec. Struct. Dyn. #(3):601 (1990); Marquet, R., and 
Houssier, C , J. Biomolec. Struct. Dyn. P(l):159 (1991); Buche, A., et al, J. 
Biomolec. Struct. Dyn. 77(1):95 (1993); Woodford, K., et al, Nucl. Acids Res. 
25(3):539 (1995); Flock, S., et al, Biophys. J. 70: 1456 (1996); Flock, S., et al, 
Biophys. J. 71:1519(1996); EP0 821 059 A2). In some of these studies, in vitro 
nucleic acid conformation and stability was found to be improved in buffer 
solutions containing any of a number of natural and synthetic osmoprotectant 
compounds, including polysaccharides such as trehalose (Carninci, P., etal, 
Proc. Natl. Acad. Sci. USA 95:520-524 (1998)), certain co-solvents such as 
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glycerol and dimethylsulfoxide (Varadaraj, K., and Skinner, DM., Gene 140:1 
(1994)); glycine and derivatives thereof (Buche, A., et al. , FEES Lett. 247(2)367 
(1989); Flock, S., etal, J. Biomolec. Struct Dyn. 73(1):87 (1995); Houssier, C, 
et al, Comp. Biochem. Physiol 77Z4(3):313 (1997)); low molecular weight 
amines such as beta-alanine, asparagine and cystamine (Kondakova, N.V., et al, 
Mol Biol (Moscow) P(5):742 (1975); Aslanian, V.M., etal,Biofizika29(4):564 
(1984)); and other nitrogen-containing compounds and amino acids such as 
proline, betaine and ectoine (Rees, W. A., etal, Biochemistry 32: 137-144 (1993); 
WO 95/20682; DE 44 11 588 CI; DE 44 11 594 CI; Mytelka, D.S., etal,Nucl. 
AcidsRes. 2<14):2774 (1996); Baskaran, N., etal, Genome Res, 6:633 (1996); 
Weissensteiner, T., and Lanchbury, J.S., BioTechniques 27(6): 1102 (1996); 
Rajendrakumar, C.S.V., et al, FEBS Letts. 410:201-205 (1997); Henke, W., et 
al, Nucl AcidsRes. 2J(19):3957 (1997); Hengen, P.N., TIBS 22:225 (1997)). 
Betaine and ectoine are natural osmoprotectants in a variety of bacterial and 
animal cells (Chambers, ST., etal, J. Bacteriol 75P(10):4845 (1987); Randall, 
K., et al, Biochim. Biophys. Acta 7297(3):189 (1996); Randall, K., et al, 
Biochem. Cell Biol 74(2):2S3 (1996); Malin, G., and Lapidot, A., J. Bacteriol 
775(2):385 (1996); Gouesbet, G., et al, J. Bacteriol 77S(2):447 (1996); 
Canovas, D., etal, J. Bacteriol 775(24):7221 (1996); Canovas, D , etal, J. 
Biol Chem. 272(41):25794-25801 (1997). 

There remains a need in the art, however, for compounds, compositions 
and methods that are useful in enhancing synthesis of nucleic acid molecules, 
particularly those that are GC-rich and/or those that are relatively large. 

BRIEF SUMMARY OF THE INVENTION 

The present invention relates generally to compounds, compositions and 
methods useful in enhancing synthesis of nucleic acid molecules, especially from 
GC-rich nucleic acid templates. In one aspect, the invention relates to compounds 
and compositions for use in synthesizing a nucleic acid molecule, particularly for 
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template mediated synthesis such as in amplification, reverse transcription, and 
sequencing reactions. The compounds and compositions of the invention 
comprise one or more compounds having a chemical formula selected from the 
group consisting of formula I and formula II, and salts and derivatives thereof In 
a preferred aspect, the compounds used in the invention include any amino acid, 
any saccharide (monosaccharide or polysaccharide), any polyalcohol, or salts or 
derivatives thereof The compounds or compositions of the invention include 
compounds having the chemical formula as set forth in formula I or formula II, or 
salts or derivatives thereof, wherein the aryl group is selected from the group 
consisting of phenyl, naphthyl, phenanthryl, anthracyl, indenyl, azulenyl, biphenyl, 
biphenylenyl and fluorenyl groups; wherein the halo group is selected from the 
group consisting of fluorine, chlorine, bromine and iodine; wherein the alkyl group 
is selected from the group consisting of methyl, ethyl, propyl, isopropyl, butyl, 
pentyl, hexyl, heptyl, octyl, nonyl, and decyl, and may be a branched chain alkyl 
group; wherein the alkenyl group is selected from the group consisting of ethenyl, 
propenyl, butenyl, pentenyl, hexenyl, heptenyl, octenyl, nonenyl and decenyl, and 
may be a branched chain alkenyl group; wherein the alkynyl group is selected from 
the group consisting of ethynyl, propynyl, butynyl, pentynyl, hexynyl, heptynyl, 
octynyl, nonynyl and decynyl, and may be a branched chain alkynyl group; and 
wherein the lower alkoxy (ether) group is oxygen substituted by one of the alkyl 
groups mentioned above. The invention also relates to salts and derivatives of 
such compounds. In particularly preferred aspect of the invention, the compounds 
are selected from the group consisting of 4-methylmorpholine N-oxide, betaine, 
carnitine, ectoine, proline, glycine, pipecolic acid, trimethylamine N-oxide, N- 
alkylimidazole compounds such as 1 -methylimidazole or 4-methylimidazole, 
poly(2-ethyl-2-oxazoline) of average molecular weight about 50,000 to about 
500,000 daltons, poly(diallyldimethylammonium chloride) of average molecular 
weight about 100,000 to about 200,000 daltons, or salts or derivatives thereof 
The invention also relates to compositions which comprise the compounds of the 
invention and one or more additional components selected from the group 
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cons,sti„g of (i) one or more enzymes having nucleic acid polymerase activity 
winch may be thermostable enzymes, (ii) one or more nucleotides, (iii) one or 
more buffering salts, and (iv) one or more nucleic acid molecuies. Preferred such 
enzymes according to .his aspect of the invention may include a DNA polymerase 
(such as Tag, Tne, Tma, PJu, VENT™, DEEP VENT™ and m DNA 
polymerases, and mutants, variants and derivatives thereof), an RNA polymerase 
(such as SP6, T7 or T3 RNA polymerase and mutants, variants and derivatives 
thereof) and a reverse transcriptase (such as M-MLV reverse transcriptase RSV 
reverse transcriptase, AMV reverse transcriptase, RAV reverse transcriptase 
MAV reverse transcnptase and HIV reverse transcriptase and mutants, variants 
and denvatives thereof). Preferably such reverse transcnptases are reduced or 
substantially reduced in RNase H activity. 

The invention also relates to methods for synthesizing . „ uc l e i c acid 
molecule, compnsing (a) mixing a nucleic acid template (which may be a DNA 
molecule such as a cDNA molecule, or an RNA molecule such as a mRNA 
molecule) with one or more (preferably two or more, three or more, four or more 
Ave or more etc.) of the compounds or compositions of the invention to form a 
mtxture; and (b) incubating the mixture under conditions sufficient to make a firs, 
nucle,c acid molecule complement ,o all or a portion „f th e template Such 
methods of the invention may optionally comprise one or more additional steps 
such as incubating the above-described firs, nucleic acid molecule under 
cond.ttons sufficient ,„ make a second nucleic acid molecule complement to al, 
or a portion of the first nucleic acid molecule. The invention also relates to 
nucleic acid molecules made by these methods, to vectors (which may be 
express™ vectors) comprising these nucleic acid molecules, and to hos, cells 
compnsmg these nucleic acid molecules or vectors. The invention also relates to 
methods of producing a polypeptide, comprising entering the above-described 
host cells under conditions favoring the production of ,he polypeptide by the hos, 
cells, and tsolating the polypeptide. The invention also relates to polypeptides 
produced by such methods. 
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The invention also relates to methods for amplifying a nucleic acid 
molecule comprising (a) mixing a nucleic acid template with one or more of the 
compounds or compositions of the invention to form a mixture; and (b) incubating 
the mixture under conditions sufficient to amplify a nucleic acid molecule 
complementary to all or a portion of the template. More specifically, the invention 
relates to a method of amplifying a DNA molecule comprising: 

(a) providing a first and second primer, wherein said first primer is 
complementary to a sequence at or near the 3 '-termini of the first 
strand of said DNA molecule and said second primer is 
complementary to a sequence at or near the 3 '-termini of the second 
strand of said DNA molecule; 

(b) hybridizing said first primer to said first strand and said second primer 
to said second strand in the presence of one or more compounds or 
compositions of the invention, under conditions such that a third DNA 
molecule complementary to said first strand and a fourth DNA 
molecule complementary to said second strand are synthesized; 

(c) denaturing said first and third strand, and said second and fourth 
strands; and 

(d) repeating steps (a) to (c) one or more times. 

Such conditions may include incubation in the presence of one or more 
polymerases, one or more nucleotides and/or one or more buffering salts. The 
invention also relates to nucleic acid molecules amplified by these methods. 

The invention also relates to methods for sequencing a nucleic acid 
molecule comprising (a) mixing a nucleic acid molecule to be sequenced with one 
or more primers, one or more of the compounds or compositions of the invention, 
one or more nucleotides and one or more terminating agents to form a mixture,' 
(b) incubating the mixture under conditions sufficient to synthesize a population 
of molecules complementary to all or a portion of the molecule to be sequenced; 
and (c) separating the population to determine the nucleotide sequence of all or 
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a portion ofthe molecule to be semifnrpH ti,~;„ 

c to oe sequenced. The invention more specifically relates 

to a method of sequencing a DNA molecule, comprising. 

(a) hybridizing a primer to a first DNA molecule; 

(b) contacting said molecule of step (a) with deoxyribonucleoside 
triphosphates, one or more compounds or compositions of the 
invention, and one or more terminator nucleotides; 

(c) incubating the mixture of step (b) under conditions sufficient to 
synthesize a random population of DNA molecules complementary to 
said first DNA molecule, wherein said synthesized DNA molecules are 
shorter in length than said first DNA molecule and wherein said 
synthesized DNA molecules comprise a terminator nucleotide at their 
3 ' termini; and 

(d) separating said synthesized DNA molecules by size so that at least a 
part of the nucleotide sequence of said first DNA molecule can be 

determined. 

Such terminator nucleotides include ddNTP, ddATP, ddGTP, ddlTP or ddCTP 
Such conditions may include incubation in the presence of one or more DNA 
polymerases and/or buffering salts. 

The invention ako relates ,o kits for use in synthesis of a nucleic ^ 
molecule, comprising one or more containers con.ain.ng one or more of ,he 
compounds or composes of the invention. These kits of the Motion may 
opnonaUy comprise one or more additional components selected from the group 
cons.st.ng of one or more nucleotides, one or more polymerases and/or reverse 
transcriptases, a sui,ab,e buffer, one or more primers and one or more terminating 
agents (such as one or more dideoxynucleotides). 

Other preferred embodiments of the present invention will be apparent to 
one of ordinal skill ,„ light of the following drawings and description of*, 
invention, and of the claims. 
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BRJEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR reaction mixtures amplified in the presence of the indicated 
concentrations of proline, 1-methylimidazole, 4-methylimidazole, betaine, or 
none of these cosolvents. M: DNA sizing markers. 

Figure 2 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR reaction mixtures amplified in the presence of the indicated 
concentrations of betaine or MMNO. M: DNA sizing markers. 

Figure 3 is a photograph of an ethidium bromide-stained agarose gel of 
samples of amplifications of three different Pseudomonas aeruginosa 
amplicons (AprD, AprE, and AprF) in the presence or absence of various 
combinations of compounds. Lanes 1: 1 M betaine; lanes 2: 1 M TMANO; 
lanes 3-7: MMNO at 2 M (lanes 3), 1 M (lanes 4), 0.5 M (lanes 5), 0.4 M 
(lanes 6) or 0.2 M (lanes 7); lanes 8. no compound control. M: DNA sizing 
markers. 



Figure 4 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of p53 exon 10 in the presence or absence of the 
indicated concentrations of betaine, MMNO, or proline, under different 
reaction buffer conditions. 

Figure 5 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of Dra DNA polymerase I in the presence or 
absence of the indicated concentrations of betaine, MMNO, or proline, under 
different reaction buffer conditions. 

Figure 6 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of p53 exon 10 in the presence or absence of 
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mixtures of MMNO and proline at different ratios, or in the presence of 
MMNO, proline, or betaine alone, under different reaction buffer conditions 
(Mg ++ concentrations). 

Figure 7 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of Dra DNA polymerase I in the presence or 
absence of mixtures of MMNO and proline at different ratios, or in the 
presence of MMNO, proline, or betaine alone, under different reaction buffer 
conditions (Mg ++ concentrations). 

Figure 8 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of the GC-rich P32D9 template demonstrating 
the effects of mixtures of MMNO and proline, or of betaine, on annealing 
temperature optima. 

Figure 9 is a photograph of an ethidium bromide-stained agarose gel of 
samples of PCR amplification of the Fragile X locus from genomic DNA of the 
K562 cell line in the presence of various concentrations of either betaine or of 
1:1 mixtures of MMNO and proline. Lanes 1: no cosolvent; lanes 2: 0.25M; 
lanes 3: 0.5M; lanes 4: 0.75M; lanes 5: 1 M; lanes 6: 1.25 M; lanes 7: 1.5 M; 
lanes 8: 1 .75 M; lanes 9: 2 M. M: DNA sizing markers. 

Figure 10 is a photograph of an ethidium bromide-stained agarose gel 
of samples of PCR amplification of two different long GC-rich adenovirus 
DNA fragments in the presence or absence of different concentrations of 1 : 1 
mixtures of MMNO and proline. Lanes 1 : no cosolvent; lanes 2: 0.25 M; lanes 
3: 0.5 M; lanes 4: 1.0 M. M: DNA sizing markers. 

Figure 11 is a photograph of an ethidium bromide- stained agarose gel 
of samples of PCR amplification of GC-rich fragments of K562 genomic DNA 
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m the presence or absence of various concentrations of 1 : 1 mixtures of MMNO 
and proline (lanes A), betaine (lanes B), L-carnitine (lanes C) or DL-pipecolic 
acid (lanes D). Lanes 1: no cosolvent; lanes 2: 0.25 M; lanes 3: 0.5 M; lanes 4: 
1 M; lanes 5: 1.5 M; lanes 6: 2 M. M: DNA sizing markers. 

Figure 12 is a photograph of an ethidium bromide-stained agarose gel 
of samples of PCR amplification of GC-rich fragments of K562 genomic DNA 
in the presence or absence of various concentrations of betaine (lanes A) or 
ectoine (lanes B). Lanes 1 : no cosolvent; lanes 2: 0.25 M; lanes 3: 0.5 M; lanes 
4: 1 M; lanes 5: 1.5 M; lanes 6: 2 M. M: DNA sizing markers. 

Figure 13 shows the structures of a number of example compounds 
that may be used in accordance with the invention. 



DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are utilized extensively. In order to provide a clearer and 
consistent understanding of the specification and claims, including the scope to be 
given such terms, the following definitions are provided. 

Library. As used herein, the term "library" or "nucleic acid library" means 
a set of nucleic acid molecules (circular or linear) representative of all or a 
significant portion of the DNA content of an organism (a "genomic library"), or 
a set of nucleic acid molecules representative of all or a significant portion of the 
expressed genes (a "cDNA library") in a cell, tissue, organ or organism. Such 
libraries may or may not be contained in one or more vectors. 

Vector. As used herein, a "vector" is a plasmid, cosmid, phagemid or 
phage DNA or other DNA molecule which is able to replicate autonomously in a 
host cell, and which is characterized by one or a small number of restriction 
endonuclease recognition sites at which such DNA sequences may be cut in a 
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determinable fashion without loss of an essential biological function of the vector 
and into which DNA may be inserted in order to bring about its replication and 
cloning. The vector may further contain a marker suitable for use in the 
identification of cells transformed with the vector. Markers, for example, include 
but are not limited to tetracycline resistance or ampicillin resistance. 

Primer. As used herein, "primer" refers to a single-stranded 
oligonucleotide that is extended by covalent bonding of nucleotide monomers 
during amplification or polymerization of a DNA molecule. 

Template. The term "template" as used herein refers to double-stranded 
or single-stranded nucleic acid molecules which are to be amplified, synthesized 
or sequenced. In the case of a double-stranded molecules, denaturation of its 
strands to form a first and a second strand is preferably performed before these 
molecules may be amplified, synthesized or sequenced, or the double stranded 
molecule may be used directly as a template. For single stranded templates, a 
primer, complementary to a portion of the template is hybridized under 
appropriate conditions and one or more polymerases may then synthesize a nucleic 
acid molecule complementary to all or a portion of said template. Alternatively, 
for double stranded templates, one or more promoters (e.g. SP6, T7 or T3 
promoters) may be used in combination with one or more polymerases to make 
nucleic acid molecules complementary to all or a portion of the template The 
newly synthesized molecules, according to the invention, may be equal or shorter 
in length than the original template. 

Incorporating. The term "incorporating" as used herein means becoming 
a part of a DNA and/or RNA molecule or primer. 

Amplification. As used herein "amplification" refers to any /„ vitro 
method for increasing the number of copies of a nucleotide sequence with the use 
of a polymerase. Nucleic acid amplification results in the incorporation of 
nucleotides into a DNA and/or RNA molecule or primer thereby forming a new 
molecule complementary to a template. The formed nucleic acid molecule and its 
template can be used as templates to synthesize additional nucleic acid molecules 
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As used herein, one amplification reaction may consist of many rounds of 
replication. DNA amplification reactions include, for example, polymerase chain 
reactions (PGR). One PGR reaction may consist of 5 to 100 "cycles" of 
denaturation and synthesis of a DNA molecule. 

Oligonucleotide. "Oligonucleotide" refers to a synthetic or natural 
molecule comprising a covalently linked sequence of nucleotides which arejoined 
by a phosphodiester bond between the 3' position of the deoxyribose or ribose of 
one nucleotide and the 5' position of the deoxyribose or ribose of the adjacent 
nucleotide. Nucleotide. As used herein "nucleotide" refers to a base-sugar- 
phosphate combination. Nucleotides are monomeric units of a nucleic acid 
sequence (DNA and RNA). The term nucleotide includes ribonucleoside 
triphosphates ATP, UTP, CTG, GTP and deoxyribonucleoside triphosphates such 
as dATP, dCTP, dITP, dUTP, dGTP, dTTP, or derivatives thereof Such 
denvauves include, for example, [«S]dATP, 7-deaza-dGTP and 7-deaza-dATP 
and nucleotide derivatives that confer nuclease resistance on the nucleic acid 
molecule containing them. The term nucleotide as used herein also refers to 
dideoxynbonucleoside triphosphates (ddNTPs) and their derivatives Illustrated 
examples of dideoxynbonucleoside triphosphates include, but are not limited to 
ddATP, ddCTP, ddGTP, ddlTP, and ddTTP. According to the present invention' 
a "nucleotide" may be unlabeled or detectably labeled by well known techniques' 
Detectable labels include, for example, radioactive isotopes, fluorescent labels 
chemiluminescent labels, bioluminescent labels and enzyme labels. 

Hybridization. The terms "hybridization" and "hybridizing" refers to base 

P^nngoftwocomplementarysingle-strandednucleicacidmoleculesCRNAand/or 
DNA) to give a double-stranded molecule. As used herein, two nucleic acid 
molecules may be hybridized, although the base pairing is not completely 
complementary. Accordingly, mismatched bases do not prevent hybridization of 
two nucleic acid molecules provided that appropriate conditions, well known in 
the art, are used. 
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Unit. The term "unit" as used herein refers to the activitv of 

When referring for examnlo ♦ .u * *" CnZyme 

s ' Ior exa mple, to a thermostable DMA ™i 

activityistheamountofenzymethat W„ ■ " ' ^ ° f 

acid in™, k, incorporate 1 0 nanomoles ofdNTPs into 

acid-insoluble material (i.e. DNA or RNA^ ; ™ • u ^sinto 

Other terms used in the fields nf r»„ u- 

molecular and cell bioW 7u DNA ™« 

^ cell biology as used herein will be eenmii,, ^ 

<**»* sfcil! in the app,icab,e an, * " y °" e ° f 



Overview 



The present invention relates aenerallv t„ „ 
compositions comnric,™ provides compounds and 

compositions are used Th, ° 3 ° f SUCh c ™P°unds or 

uns are used. The compounds or compositions of th„ ; 
in the form of a salt. invention may be 



Formula I: 
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wherein A is I 

("4)d CR, 



wherein X is I 



(CRe) m (Y) ( 



wherein Z may be the same as or different from Y, 

wherein each Y and Z are independently selected from the group 
consisting of -OH, -NH 2 , -SH, -P0 3 H, -C0 2 H, -S0 3 H and hydrogen 
an mteger from 0 to 2, m is an integer from 0 to 20 and e is an integer 

from 0 to 2; 

wherein Rj, Rj and may be the same or different Md „ jndep 
selected from the group consisting of hydrogen, alky,, alkenyl, alkyny, ar y, 
-no .h,ol, mercaptan, halo, ni.ro, „itri, 0 , hydroxy, hydroxy, hy^,' 
P osphato, alkoxy, oxide, ether, ester (alkanoyloxy), carboxy carbony,, sulfj,' 
sulfomc and amido groups, and d is an integer from 0 to 2; 

wherein a, b, and c are independently an integer fr„ m 0 to ! tne 
provtso that no more than two of a, b, and c are zero; 

select dT er T" ^ ^ R " ^ " M *»» - « ^P^ntly 

selected from the group consisting of: 

a) =-0; 
« (W). 



-(CR,)„ ; 
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P-osp.a.o, a, toxy , oxide , e(heri ^ ; ^> 

" Z' T su,fony '' sulfomc ' Md — — • - - 

from 0 to 2, and n , s an integer from 0 to 20; and 



wherein q may be 1 to 100,000. 



In compounds of formula I when a = i tho 

' en ^ ~ J > the compound of formula I m*,, k 
con S1 dered a monomer, and when q = 2 to 100 000 th. 

maybeconsideredamultixneroraool compound of formula I 

which may each have ^"'^"^^to^™**^ 

connected J ^ " ^ ^ -y be 

connected y one or more bonds through one or more groups to formamultmer 
(e.g., a polymer) of the compound of formula I. 

In a preferred aspect, when a, b or c is zero th» 
is a pair of electrons. ' * »°up 

In another preferred embodiment, when q=l and one of (R l} (R) and 
» =0 and the other two R groups are the Same or diffe en ^ 
independently selected from th. aitterent and are 
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Formula II: 



(Ri) ; 



(Rs) t 



I 

(CRs), 



m 



(R 3 )< 




wherein Formula II is saturated 



or unsaturated 



wherein q may be 1 to 100,000; 



wherein X is selected from the 



group consisting of N, C, O, P and S- 



-NH-C(CH 3 ) 2 -NH- -NH-CH 9 * \ CH (CH 3 )-0-, 
alkoxv nvH u ° ther merca P ta »> Phosphato 

^r;::: etter ' ^ — j 

m, k k Rs , r^, R , and R , may be ^ or 



-22- 



wherein a b c H ™ 

0 to 5 for m , n> and Q _ ^ t0 2 for a > b > * d and e, 



In compounds of formula II when o - i ,k 

- co^ red a monomer , Md when 

formula II may be considered „ . . ' 100,000, the compound of 

— — )-ecom P ou nd :::i:;r sroups ,o f °™ a 

In one preferred aspect of the invention V „„h/ v 

— I- '-otherpreferredaspec, Vand/ X I M , ^'^"'"•^ 

y, wnen a, b, c, d and/or e are 7em tu~ 
^oup is a pair of electrons or involved in ft 7 ° m ^8 R 

structures. * * f ° rmation «f the unsaturated 



For compounds of formulae I and II; 



typical C 6 . 14 ary i gro inc , 

naphthy,, phenantfcy, '^TT ^ 
biptayleny, and fluoreny, groups; "* ta ^ 

- m „::::i:: ^ mciud * bw - - m — - — — * 

- Ranched chain afcy, ^ ^ ' ** «""*• - «■ 

^^ssir, but " n °' ,,mited <° ~* 

- - - - - - +^zz£r* ^ ^ 

typical C 2 .„ alkynyl srou { , 
h-ynyl, hep,y„ yl , oc(yny , ' " ^ fW*l butyny,, pe„ tynyl> 

bached chain aiityny, ' deCy " y ' »»» - *• *• - W e„ as the 
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J y ^ oranched chain isomers thereof 
Compounds which may be used in a ■ 
saccharin • accordance with the invention include 

sacchandes, amino acids, and polyalcohols, and derivatives thereof F 
saccharides inCude but are not limited to oligosacchlT ' * 

- as trehalose, maltose, glucose , sucro T 2 

cellobiose, l evanbiosej ' ^ Xy, ° blose ' ^-obiose, 

alt™. , qu«obiose, 2-P-glucuronosylglucuronic acid allose 

altrose, galactose, gulose, idose, mannose, talose sorbito. . , ' 
arabitol. ' to1 ' levul °se, xylitol and 

Such amino acids may include but are not limits t , • 

invention. Examples include N-r3'- one v m *u n u . 

Examples of polyalcohols include but are no, limit*, t0 glycol ethv, 
glycol, polyethylene glycol and the life ^ ' 

Preferred compounds ofthe invention may include bu,are„„,r . , 
4-methylmorpholine N-oxide (MMNO) andNaLr „ '°' 

- .-methylimidazole, 2-me^lT ^ ^ C ~ S SUCh 
<-*^**^^ : . — ~e, betame 

arid 7 u ,■ um '' ,aunne . ec 'oine,pip e colinicacid pipecolir 

N-oxide, 3-me, hyBs oxazo,.5 ( 4H,-o„e morpholine sal, ! " ^ 

tae.hyl glycine> N . methy| . pro|ine " C ^ ^ S °— ■ N-N- 

~oxy„cacid, .--^L^TT 
acid 5-methvi -5 • uoxyjic acid, 2-pyrazinecarboxylic 

meth y , - 2 -Pyrazinecarboxylicacid 4 n^h, i < • •_, 

.-Mpyrrole-Wboxyltc a ,d Tj^^TT**™"** 

a, ethyl-3-methyhmidazolium nitrate, ethyl 
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azetidi„e-l.p ropionate N,N-dimethyl-phe„ v lalam„ B <! k 
^imida^dicarooxyhc acid, ca^tine ^ a b ™»°**° «* 

~ y d roxystachydrine , N „^ amo ;:;^ 7**-. -* 

N-ace^u^,^^ >. *»* *™«™. .hereof, 

•etrahydro-amethvl 4 nin, •„• me ' hy ' Sl ' ,f0mo P ro P i <'™e, ectoi„e(,,4,5,6- 

zmem y 1 -4-pxrymidine carboxilic ari/n i, ^ 
P-glutammine, oclopine> J" ^roxyecoine, ghuamate, 

W-^ S „e 0f ,Z- T M N "^ de (TMA0> ' 

« ^rriii;; 1 5o ' oo ° ,o abou < 

I « «™uia I. contains a carboxyl **" "* <™ ~ 

-h.de esters ^ am ides of Jcl ' ° f ' h ~" 
-tine methods of chemic/s^f ^ ^ ^ ^ ™ »** 

co mPOund itr^ r examp,e by condens,ng ,he ^ 

~ ta no, limttd to III; * mi c - 

pentano,, Md ^ 

™"° —Pounds useft,, according to this aspect of ,„ " 
amino compounds ^ c ^J. thlS ^ ° fthe '"vmion include C M 

- w- ^STJT™* incWns bm not "°* ed * 

chain isomers ,h lof T h - 

contains a hydroxy ,OH, I' T ' C ° mP0U " d ° f ^ ' « " 

/ v wi; group, the comnoundc nf+u * 

-ers of such compel which may „ e J™* ° f "» — -We the 
-ai„i» g compound w i<„, for exam , " ' 

— ^d^anoicacidorananh « ^T^^ 

Propionic acid, butyric acid „. , *' C add ' acetic 

•J™ acd, pen,a„o,c acid, hexanoic ^ md ^ ^ 
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apparen, t0 01K of ordinaty sffl ,„ J ™~ ~ ™« ^ 
knowledge in the an. gS C ° mained herei " and 



example by mixing a solution of the narrimi. ^ntnesis, for 

«* such as hydtochlotic ac , d ~ ™» a S o lutlon of an 

ac*, ctnc acid, tartaric acid, c^ZZ^lT- ' "** 

- - * «. w o;^t:;;:;:r d" " 

routae methods of chemical synthesis for , , u V ^ us,n S 
Particular compound wi th . 0 ,1„ k * 3 S0 ' U,i °" 

—urn Mro.de, chohn ^ ^ T " ^ 
O-^of.hecomponnd,^:;"^ ^ ^ * »* 

- - ;: ,he te a CW „ gs ^ herein and ~- sm 

a, fcast f our> at leas , ' ° f * ,ea St two, at teast 

'n a preferred aspect, 2 10 , „ ~ l o t TT" ^ " ^ 

- 3 of such compounds a re used. ,„ a e ^ * *° * * '° $ ' 2 » * - * 

"» compositions obtained b y ra w M ^ inVenti ° n '° 

compound, ,„ mixi„ s such * 7 C ° mbma ' ,0n ° f "» ** """ioned 

mixing such compounds together certain 

Pla- which may cha„ g e the 8tructure rf ^ „ ^ '~ take 
- ~ « in the f omata of _ „ d ~ ^ ~ - 

-erse Option J ^^™«^^0-*^PO t X 

-auencng methods. The invention also retaes to 
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nucleic acid molecules produced by these method ♦ r 

thereof ,nH ♦ ' ° fra « ment s or derivatives 

— .o Produ ^rr: o,h r ofsuchn " ddcac,d 

Synthesis Methods 



Stasis of co mp0lmds of fomu , a , may be ^ ^ fo|bws: 



(Ri) a 

(R 3 )c N (R 2)b 



+ 



Br 

I 



(A 



(R4) d — CRg— (CRg^ 



(Y) e 



(Ri)a 
(R 3 )o — W — (R 2 ) b 

(R4)d— CRg— (CRg^ 



in 



For example, when R 4 and Z are H Y is -CO H H f , 
^ starting chemical is BrCH CH CO H V ■ * " ^ ^ 

^H 2 CH 2 C0 2 H which , s commercially available. 

Synthesis of compounds of Formula II may be carried out as follows: 
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(R 2 ). 



(R 3 )c-i 



NH„— 



2— (CRs) m — (CR^i—c 



\ 



r(CH 3 ) 3 — c— 



R 3 

Ofc-C 



=0 



NaBK 




( R 2)b (R 3 ) c ., 




V 



f R 2)b (R 3 )< 



V 



R 3 

I 



(pRehii (CR 7 ) n . 1 -^-CH — 



OH 



NH — (CRshr-fCR,),,— OH 



(Rs)e 

Cl (CRe^-CI 




(R 2 ), 



Y 



J 



1) H + 

2) Base 



NH— (CR s ) sr - ( CR 7 ,_. 0 - 



(Rs) e 
I 

(CRaJo — CI 



(Rsk a ; R 2)h 

(CRsJo Vr^ 

° (CR 7 )n 

(R 3 ) c 



1) Rl 

2) Perbenzoic Acid 

(R5L /^[ R 2)b 

(CReJo (CRs) m 
° (CR 7 )n 

(R 3 ) c 
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For example, when R 2 and R are H h * a 
*u i 3 H > b and m are 1 ■ c-1 anw „ i 

then the starting chemical is NH 2 _ CH COR wu- ^ 

Also avails * 18 comme ™a% available 

cH 3 c Hj c„ (Br)COA H ; c ^ 

Louis, mo). compoimds may b = <*— *» ^ ; st ; 

Numerous compounds for use in th« • 

commercially for examm. f « ■WHytadaale) may be obtained 

y, lor example from S.gma ( St . Louis> MQ) 

To formulate the compositions „f , he imaitioa om 
described compounds may be mixed ,o e e,b ■ 0W " a "° Ve - 

a solution of eacb compoun ™" «~ed form, prepay 

— . * - - composi,,: iiizt - — *■ 

one compound and admixi„ g ,„ e po^^ of 8 3 **** "* 
compounds. ° rm ° f one or m ™ additional 

"elude, but are not limited ,„ „ , Polymerase activity may 

„avi„ g ^ m " ^ ^ 
-scriptase activity * ^ P °'~ «« 

: repreferab ^ s ; compositions 

Stabi,ity " - -d herein generally mean the ret ^ 

a compound or an enzyme of the * * C ° ra P 0nent » as 

enzyme of the composition, of at least 7no/ , 

' least 70/o > Preferably at least 
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80%, and most preferably al least 90% „f the oris™, 

«** after the ^position ^ ^ J ™ * ^ ^ °~ 

-*« ab „ u , s, montts al . , emp ::: ~ ^ -~ 

<* <e ra "working concentration- means the ^ "therein, 

-pound or m ^ tha , , „ q ~ ~n of a chemtca, 

solution to perform a particul, „ ' "ncentratton used in . 

P orm a parttcular functton (such as synthesis of nucleic acid^ 
Water which may be used i„ f ' 

■nven-on is preferab, y d,ll ^ 8 CO,nPOSi " 0nS rftt » 

y cicraoiy distilled, deionized and sterile a , 

micrometer filter), a„ d „ free of " *"* ffltered ("-ugh a 0.1-0.2 

Suc h „ a ,eri sav , lableMM " n ' amna ''7 b " se - >«ase e.ymes. 
L o„,s, r. ssouri) , or I t^t^^-^Company 

known t0 those skilled in the arl * aCC ° rdMS '° 

In addition to the chemical (and optionally polypeotidel 
P-n, compositions preferably comp.se one , ^ 2 T"' 
necessa^ for synthesis of a nucleic acid molecule Z ' 
for use in forming the nresen, ,„ • • p a"™Iarly preferred buffers 

Pbosphate or JL JZ^Z^^ 
-ough alternative buff^ f el (hy<, ~^— ^ «, 
WS. may be used wZ aPPrMinM,e ^ - ^ - 

-actor s, ts such J l: ;: re -; — * — s„s, 

-ate) are included in the compel ~ 

^i::r it:::: ::r t buto - — - - 

action of the chemical comlo T ^ " *° ^ <** - 

way, any pH-sen<!it^ k 

n optlonal, y the polypeptides). In this 

present compositions. ^adation ^ring formulation of the 

To formulate the buffered salts solution a buffer salt 

a salt of TrisChydroxvmethvn, • . ° h 18 P refer *Wy 

v yuroxymetnyl)aminomethane (TRl<i<fh\ anA 

UKli>®), and most preferably the 
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hydrochloride salt thereof ic ^ u- , 

1 "icreor, is combined with a <,„«• • 

«— (Prefer e , lner ^ ^ or J^*- a sal, of 

.oprov id ea„o r «„ gconcemra( , on "» «* *~-> may be added 

-^ar, «, most preferab , y 3_ 7 ° 5 f 2 , '•«•« 
~ y potassium ch , oride) J - - A - 0- 

'0,00 miliar a „ d mos( ^ 1 ^ °' U " M ' « * 
a 8 e« sucb as ditnio,^, may be " £ f ^ " A 
a <U concentrate of abou , , J" ^ '° *• pref e rably at 

about 5-50 mM „ r abmt 7 5 . 20 J ^ ^ **** a conc« n ,r a „o„ of 

mM, and most preferahlv a * 
about 10 mM. A small amount of a ^ af „ h , * " C0 " cen,ra <™ of 

«h a, disodium EDTA. mav ! T ""^^'^ (EDTA) 
-o ug » i„ clusion °< about o. 1 milnmolar^ 

«% of ,„e compos of r;ii:; be ^ ,o ,he a — « 

a"- «. buffered sa)t " PreSCM ,nVe "'™ A«e r addition of*, buffers 

preferably 8 0 to 9 0 * PH VaJue of 7 4 to 9 2 

y ° 9 °' Md most Preferably about 8 4 

- — * P-duce tte pleT, "~ - «M* 

In preferred compositions, ,„ e 

a»ou. 6:,, a bou,5..,, ' > *>"< 9 about 8:1, a bou, 7:1, 

"bom 15 1, about , 25 ., ' , , , t ' ^ 2 5 '■ a bout2:I, abou, 1.75:1, 

— about ,,, abou ' I ab ° U ' U 25 ' *» ":«.5. about ,,, 5 , 

' 400111 J -3, about 1:4 about i-« u 
*u. 1: 8 , abou, l: 9 , orabom 1: , 0 Moreprefcra «"»" about 1:7, 

a- a molar or stoichiometric rabo of ab ou, , , n ' C ° mP0UndS 
™ios m ay be determined b, routine opting ' T " S, ° iCW ° me,riC 
« - * form , he composWons ^ ' ^ antwocompounds 

nvention, of the amount of each 
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con,p„„ nd ^ ^ be optim . zed 

0-3-2.5 M, about 0.4-2 M, about 0.4-1 5 M about 0 4 1 M k ' 

Depends on the confounds used other!' " " °- 4 -°- 8 M 

on the desired result The co ^ ^ ^ ^ ^ used depending 

- to te ^"oT B-,M> ^ inVentl ° n ^ ^ bC St ° red * 

-Months oCth n 0 " 

ontnsat4C an dthreemonthsto ay earorlon fi erat 20°r • 

synthesis of nucleic acid molecules. C, unt,l use m the 

A variety of poIypeptides 
accordance with the present invention Included u " 

prases, Such po]ymerases j e ^ C 2—-^ 

TW*^ „^ 0 ^ a ^ DNA ^^DNApoly merase , 

DNA polymerase ' ymeraS6, 7 * n ***° ^ 

polymerase, 77,m« OCOCCMJ ///omfo 

/>^ OC c w ^^ rWD tJN1 ) DNA polymerase, 

ifju) DNA polymerase, DEEPVENT™ DNA ™i 
/> OCOC c«.v w 00 «7 DNA . DNA P^ymerase, 

polymerase * •// P ° lymerase ' ^cocca* sp KDD2 (KOD) DNA 

polymerase, 5aa// w , stenthermaphilus (Est) DNA nolv 

^ DNA polymerase Sulfa! J ' 
™i Polymerase, Sulfolobus acidocaldarius rSnr) fima 

polymerase, Thermoplasma acidophil (Too) DNA . 

r??WWy DNA po , ymerase n ( J DNA P oI ^se, 7*«,W 

~ polymerase, 77?erw^ /7> w i DNA ™i 

variants and denvatives thereof rma i 
such as T3 T5 anH <;p* , a hereof. RNA polymerases 

The nucleic acid polymerases used in the nr. ♦ • 
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DNA po,vmerases inciude T7 DNA po lym erase, T5 DNA p„, ymeras e, ^ 
fragment DNA pCymerase, DNA po.vmerase I„ a„ d ,„ e like . Preferred 
t he™„ st a W eDNApo, ym e ras es«ha tra a yb e„ S edi„ tt e ra e, h „ dsandcomposWom 

of ^e, n ve„ ti „„i„ clude a, ^ ^ % w ^ Stoffd 

7 ° NA - — , variants and deprives 

« U.S. Patent 5,079,352; U.S. Paten.5,6,4,365; us . Paten , . 
U.S. Patent 5,270,,79; U.S. Patent 5,047,342; U.S. Patent No. 5 5,2 462 WO 
2/06 I88; WO 9 2/„ 6 200 ; WO 96 ,0 6 40 ; B^s,W.M., Ce „ e .2, W5 0 9 ^ 

S F /; ***** 2 - 275 - 287 (i993); ^ j -h 

^ 2, ( ,5):32 59 -32 6 0 ( ,994». For ^ tion of lo „ g ^ 
ac d mo,ec„,es (e.g., „u C ,eic acid molecu|es |onger ^ ^ 3 

a. east two DNA po, y merases (one substanMJy ^ y ^ 
and the other having 3- exonuciease act,vitv) are , yp ica,, y used. &. us Patem 
No. 5,436, 49; U.S. Pa,e„, No. 5,5,2,462; Barnes, W.M., Ge„e „ 2 2 9. 35 

1*7. the disdo^ of which are i„corpor ated herein i„ t he,r enriret.es' 
Examp,e S of DNA po^erases substantia,,, , acldng in 3 . ^ 

™r n ;" Wtedt °' "* r " <eXO ) ' <- ).^o ) 

and T,H DNA po lym erases, and mutants, variants and derivari V es thereof 

Polypeptides hav,„g reverse transcriptase activity for use in the invention 
-ude polypep(jde havjng ^ trjnscripase ^ 

■nch.de, hut are no, hmited , 0 , retrovira, reverse transcriptase, retrotransposon 
reverse transcriptase, hepatitis B reverse transcnptase, cauMower mosaic virus 
-verse transcriptase, bacteria, reverse transcriptase, Tth DNA po lym erase, To, 

No, 4,8 9, 818 and 4,965,,S 8) , T„e DNA po,vmerase (WO 96/.0640), Tma 
DNA poivmerase (U. S. Patent No. 5,374,553) * mutants> variaMS Qr 

and 08/706,706, of A. John Hughes and Deh If rt, • . 

nugnes and Deb K. Chatterjee, both filed September 
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9, 1996, wh,ch areincorporated by referent herein fa their entireties) Preferred 
enzymes for use in the invention include those that are reduced or substantially 
reduced m RNase H activity. By an enzyme -substantial reduced ,„ RNase H 
ac^ty- is mean, ma , , ne enzyme ha$ ^ ^ ^ ^ ^ 

*» about ,5* I0K or 5%> and mos , preftrabIy ^ (han abou( ^ ^ 
«N» H activ.ty of the corresponding wiidtype or RNase „ * enzyme such as 

w,^eMo,o„eyMu„„eL e u k e m iaVi™s ( M-MLV, > A™Mye 1 ob I as,osisV I n J s 
(AMV) or Rous Sarcoma Virus (RSV) reverse transcriptases. The RNase H 
act™* of any en2yme may „ e fcy f ^ ^ ^ ^ ^ 

descnbed, for example, in U.S. Paten, No. 5,244,797, in Kotewicz, ML elal 

"ZTT ^ 16 265 " 988) ^ m ^ ° F ' * < ™ ^91 
0992) the disclosures of all of which are My incorporated herein by reference 

Partly preferred such polypeptides for use in the invention indude, bu, are 
« to, M-MLV H- reverse transcript, RSV „■ reverse transcriptase, 
AMV H reverse transcribe, RAV (Rous-associated virus) H' reverse 
-~>^V(mye,ob^^ 

HIV H reverse transcriptase. „ wi„ be understood by m rf 
however, that any enzyme capab!e of producing a DNA molecule from a' 
nbonuclcc acid molecule (,,, having reverse transcriptase activity) th a , is 
substantially reduced in RNase H activity ma y be uxd .„ 

compositions, methods and kits of the invention. 

com T T ^ P °' ymeraSeS *• mVMtto " •» 

ommerctally, for e X amp,e from Life Techn olog ie S , ,„c. (Rockville, Ma^iand) 

Perion-Elmer (Branchburg, New Jersey,, New England BioLabs (Beverly 

Massachusetts) or Boenringer Mamtheim Biochemicals (mdianapolis, 

Po ypephdes having reverse transcriptase activity for use in the invention may be 

Ma 1 ei ~ >My ' fr ° m ^ TeCh " 01 ^ - 

M^land), Pharmacia (Piscataway, New Jersey), Sigma (Sain, Louis, Missouri)' 

or Boehrmger Ma^theim Biochemicals (Indianapolis, mdiana). Ahematively 
po.ypept.des having reverse transcriptase activity may be isolated from J 
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natural viral or bacterial sources according to standard procedures for isolating 
and purifying natural proteins that are well-known to one of ordinary skill in the 
art (see, e.g., Houts, G.E., et al., J. Virol. 29:517 (1979)). In addition, the 
polypeptides havingreverse transcriptase activity may be prepared by recombinant 
DNA techniques that are familiar to one of ordinary skill in the art (see, e.g., 
Kotewicz, M.L., etal, Nucl. Acids Res. 16:265 (1988); Soltis, D.A., and Skalka,' 
A.M., Proc. Natl. Acad. Sci. USA £5:3372-3376 (1988)). 

Polypeptides having polymerase or reverse transcriptase activity are 
preferably used in the present compositions and methods at a final concentration 
in solution of about 0.1-200 units per milliliter, about 0.1-50 units per milliliter, 
about 0.1-40 units per milliliter, about 0.1-3.6 units per milliliter, about 0.1-3 4 ' 
units per milliliter, about 0.1-32 units per milliliter, about 0.1-30 units per 
milliliter, or about 0.1-20 units per milliliter, and most preferably at a 
concentration of about 20-40 units per milliliter. Of course, other suitable 
concentrations of such polymerases or reverse transcriptases suitable for use in the 
invention will be apparent to one or ordinary skill in the art. 

Methods of Nucleic Acid Synthesis 

The compounds and compositions of the invention may be used ^methods' 
for the synthesis of nucleic acids. In particular, it has been discovered that the 
present compounds and compositions facilitate the synthesis, particularly via 
amplification reactions such as the polymerase chain reaction (PCR), of nucleic 
acid molecules that have a high content of guanine and cytosine fa, "GC-rich" 
nucleic acid molecules). The present compounds and compositions may therefore 
be used in any method requiring the synthesis of nucleic acid molecules, such as 
DNA (particularly cDNA) and RNA (particularly mRNA) molecules. Methods 
in which the compounds or compositions of the invention may advantageously be 
used include, but are not limited to, nucleic acid synthesis methods, nucleic acid 

amplificationmethods,nucleicacidreversetranscriptionmethods,andnucleicacid 

sequencing methods. 
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Synthesis 

Nucleic acid synthesis methods according to this aspect of the invention 
may comprise one or more steps. For example, the invention provides a method 
for synthesizing a nucleic acid molecule comprising (a) mixing a nucleic acid 
template with one or more of the above-described compounds and compositions 
of the mvention to form a mixture; and (b) incubating the mixture under conditions 
sufficient to make a first nucleic acid molecule complementary to all or a portion 
of the template. According to this aspect of the invention, the nucleic acid 
template may be a DNA molecule such as a cDNA molecule or library, or an RNA 
molecule such as a mRNA molecule. 

In accordance with the invention, the input nucleic acid molecules or 
libraries may be prepared from populations of nucleic acid molecules obtained 
from natural sources, such as a variety of cells, tissues, organs or organisms. Cells 
that may be used as sources of nucleic acid molecules may be prokaryotic 
(bacterial cells, including those of species of the genera Escherichia, Bacillus 
Serratia, Salmonella, Staphylococcus, Streptococcus, Clostridium, Chlamydia 
Neisseria, Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas 
Mycobacterium, Helicobacter, Envinia, Agrobacterium, Rhizobium and 
Streptomyces) or eukaryotic (including fungi (especially yeasts), plants 
protozoans and other parasites, and animals including insects (particularly 
Drosophila spp. cells), nematodes (particularly Caenorhabditiselegans cells), and 
mammals (particularly human cells)). 

Mammalian somatic cells that may be used as sources of nucleic acid 
molecules or libraries of nucleic acid molecules include blood cells (reticulocytes 
and leukocytes), endothelial cells, epithelial cells, neuronal cells (from the central 
or penpheral nervous systems), muscle cells (including myocytes and myoblasts 
from skeletal, smooth or cardiac muscle), connective tissue cells (including 
fibroblasts, adipocytes, chondrocytes, chondroblasts, osteocytes and osteoblasts) 
and other stromal cells (e.g., macrophages, dendritic cells, Schwann cells) 
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Mammalian germ cells (spermatocytes and oocytes) may also be used as sources 
of nucleic acids or libraries for use in the invention, as may the progenitors, 
precursors and stem cells that give rise to the above somatic and germ cells. Also 
suitable for use as nucleic acid sources are mammalian tissues or organs such as 
those derived from brain, kidney, liver, pancreas, blood, bone marrow, muscle, 
nervous, skin, genitourinary, circulatory, lymphoid, gastrointestinal and connective 
tissue sources, as well as those derived from a mammalian (including human) 
embryo or fetus. 

Any of the above prokaryotic or eukaryotic cells, tissues and organs may 
be normal, diseased, transformed, established, progenitors, precursors, fetal or 
embryonic. Diseased cells may, for example, include those involved in infectious 
diseases (caused by bacteria, fungi or yeast, viruses (including HIV) or parasites), 
in genetic or biochemical pathologies (e.g. , cystic fibrosis, hemophilia, Alzheimer's 
disease, muscular dystrophy or multiple sclerosis) or in cancerous processes. 
Transformed or established animal cell lines may include, for example, COS cells, 
CHO cells, VERO cells, BHK cells, HeLa cells, HepG2 cells, K562 cells, F9 cells 
and the like. Other cells, cell lines, tissues, organs and organisms suitable as 
sources of nucleic acids for use in the methods of the present invention will be 
apparent to one of ordinary skill in the art. These cells, tissues, organs and 
organisms may be obtained from their natural sources, or may be obtained 
commercially from sources such as American Type Culture Collection (Rockville, 
Maryland) and others that are known to the skilled artisan. 

Once the starting cells, tissues, organs or other samples are obtained, 
nucleic acid molecules (such as DNA, RNA (e.g., mRNA or poly A+ RNA) 
molecules) may be isolated, or cDNA molecules or libraries prepared therefrom, 
by methods that are well-known in the art (See, e.g., Maniatis, T., et al, Cell 
15:687-701 (1978); Okayama, H., and Berg, P., Mol. Cell. Biol. 2:161-170 
(1982); Gubler, U., and Hoffman, B.J., Gene 25:263-269 (1983)). 

In the practice of this aspect of the invention, a first nucleic acid molecule 
may be synthesized by mixing a nucleic acid template obtained as described above, 
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which is preferably a DNA molecule such as a cDNA molecule, or an RNA 
molecule such as an mRNA molecule or a polyA+ RNA molecule, with one or 
more of the above-described compounds or compositions of the invention to form 
a mixture. Under conditions favoring the reverse transcription (in the case of an 
RNA template) and/or polymerization of the input nucleic acid molecule, synthesis 
of a first nucleic acid molecule complementary to all or a portion of the nucleic 
acid template is accomplished. Such synthesis is usually accomplished in the 
presence of nucleotides (e.g., deoxyribonucleoside triphosphates (dNTPs), 
dideoxyribonucleoside triphosphates (ddNTPs) or derivatives thereof). 

Alternatively, the compounds, compositions and methods of the invention 
may be used in single-tube synthesis of double-stranded nucleic acid molecules. 
In this approach, the first nucleic acid molecule synthesized as described above 
is incubated under conditions sufficient to make a second nucleic acid molecule 
complementary to all or a portion of the first nucleic acid molecule. This second 
strand synthesis may be accomplished, for example, by a modified Gubler- 
Hoffman reaction (D'Alessio, J.M., et al, Focus 9: 1 (1987)). 

Of course, other techniques of nucleic acid synthesis in which the 
compositions and methods of the invention may be advantageously used will be 
readily apparent to one of ordinary skill in the art. 

Amplification and Sequencing Methods 

In other aspects of the invention, the compositions of the invention may 
be used in methods for amplifying or sequencing nucleic acid molecules. Nucleic 
acid amplification methods according to this aspect of the invention may 
additionally comprise use of one or more polypeptides having reverse transcriptase 
activity, in methods generally known in the art as one- step (e.g., one-step RT- 
PCR) or two-step (e.g., two-step RT-PCR) reverse transcriptase-amplification 
reactions. For amplification of long nucleic acid molecules (i.e., greater than 
about 3-5 Kb in length), the compositions of the invention may comprise a 
combination of polypeptides having DNA polymerase activity, as described in 
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detail in commonly owned, co-pending U.S. Application No. 08/801,720, filed 
February 14, 1997, the disclosure of which is incorporated herein by reference in 
its entirety. 

Amplification methods according to this aspect of the invention may 
comprise one or more steps. For example, the invention provides a method for 
amplifying a nucleic acid molecule comprising (a) mixing a nucleic acid template 
with one or more of the above-described compounds or compositions of to form 
a mixture; and (b) incubating the mixture under conditions sufficient to amplify a 
nucleic acid molecule complementary to all or a portion of the template. The 
invention also provides nucleic acid molecules amplified by such methods. 

General methods for amplification and analysis of nucleic acid molecules 
or fragments are well-known to one of ordinary skill in the art (see, e.g., U.S. Pat. 
Nos. 4,683,195; 4,683,202; and 4,800,159; Innis, M.A., et al., eds., PCR 
Protocols: A Guide to Methods and Applications, San Diego, California: 
Academic Press, Inc. (1990); Griffin, H.G., and Griffin, A.M., eds., PCR 
Technology: Current Innovations, Boca Raton, Florida: CRC Press (1994)). For 
example, amplification methods which may be used in accordance with the present 
invention include PCR (U.S. Patent Nos. 4,683,195 and 4,683,202), Strand 
Displacement Amplification (SDA; U.S. Patent No. 5,455,166; EP 0 684 315), 
and Nucleic Acid Sequence-Based Amplification (NASBA; U.S. Patent No. 
5,409,818; EP 0 329 822). 

Typically, these amplification methods comprise contacting the nucleic acid 
sample with a compound or composition (such as those of the present invention) 
comprising one or more polypeptides having nucleic acid polymerase activity in 
the presence of one or more primer sequences, amplifying the nucleic acid sample 
to generate a collection of amplified nucleic acid fragments, preferably by PCR or 
equivalent automated amplification technique, and optionally separating the 
amplified nucleic acid fragments by size, preferably by gel electrophoresis, and 
analyzing the gels for the presence of nucleic acid fragments, for example by 
staining the gel with a nucleic acid-binding dye such as ethidium bromide. 
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Following amplification by the methods of the present invention, the 
amplified nucleic acid fragments may be isolated for further use or 
characterization. This step is usually accomplished by separation of the amplified 
nucleic acid fragments by size by any physical or biochemical means including gel 
electrophoresis, capillary electrophoresis, chromatography (including sizing, 
affinity and immuno chromatography), density gradient centrifugation and 
immunoadsorption. Separation of nucleic acid fragments by gel electrophoresis 
is particularly preferred, as it provides a rapid and highly reproducible means of 
sensitive separation of a multitude of nucleic acid fragments, and permits direct, 
simultaneous comparison of the fragments in several samples of nucleic acids. 
One can extend this approach, in another preferred embodiment, to isolate and 
characterize these fragments or any nucleic acid fragment amplified by the 
methods of the invention. Thus, the invention is also directed to isolated nucleic 
acid molecules produced by the amplification or synthesis methods of the 
invention. 

In this embodiment, one or more of the amplified nucleic acid fragments 
are removed from the gel which was used for identification (see above), according 
to standard techniques such as electroelution or physical excision. The isolated 
unique nucleic acid fragments may then be inserted into standard nucleotide 
vectors, including expression vectors, suitable for transfection or transformation 
of a variety of prokaryotic (bacterial) or eukaryotic (yeast, plant or animal 
including human and other mammalian) cells. Alternatively, nucleic acid 
molecules that are amplified and isolated using the compounds, compositions and 
methods of the present invention may be further characterized, for example by 
sequencing (i.e., determining the nucleotide sequence of the nucleic acid 
fragments), by methods described below and others that are standard in the art 
(see, e.g., U.S. Patent Nos. 4,962,022 and 5,498,523, which are directed to 
methods of DNA sequencing). 

Nucleic acid sequencing methods according to the invention may comprise 
one or more steps. For example, the invention provides a method for sequencing 
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a nucleic acid molecule comprising (a) mixing a nucleic acid molecule to be 
sequenced with one or more primers, one or more of the above-described 
compounds or compositions of the invention, one or more nucleotides and one or 
more terminating agents (such as a dideoxynucleotide) to form a mixture; (b) 
incubating the mixture under conditions sufficient to synthesize a population of 
molecules complementary to all or a portion of the molecule to be sequenced; and 
(c) separating the population to determine the nucleotide sequence of all or a 
portion of the molecule to be sequenced. 

Nucleic acid sequencing techniques which may employ the present 
compositions include dideoxy sequencing methods such as those disclosed in 
U.S. Patent Nos. 4,962,022 and 5,498,523. 

Vectors and Host Cells 

The present invention also relates to vectors which comprise the isolated 
nucleic acid molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and methods for the production of a 
recombinant polypeptide using these vectors and host cells. 

The vector used in the present invention may be, for example, a phage or 
a plasmid, and is preferably a plasmid. Preferred are vectors comprising 
c«-acting control regions to the nucleic acid encoding the polypeptide of interest. 
Appropriate tram-acting factors may be supplied by the host, supplied by a 
complementing vector or supplied by the vector itself upon introduction into the 
host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression of a polypeptide encoded by the nucleic acid molecules of the 
invention; such expression vectors may be inducible and/or cell type-specific. 
Particularly preferred among such vectors are those inducible by environmental 
factors that are easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
episomal-and virus-derived vectors, e.g., vectors derived from bacterial plasmids 
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or bacteriophages, and vectors derived from combinations thereof, such as 
cosmids and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda P L promoter, the E. coli lac, trp and tac promoters. 
Other suitable promoters will be known to the skilled artisan. The gene fusion 
constructs will further contain sites for transcription initiation, termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
of the mature transcripts expressed by the constructs will preferably include a 
translation initiation codon at the beginning, and a termination codon (UAA, 
UGA or UAG) appropriately positioned at the end, of the polynucleotide to be 
translated. 

The expression vectors will preferably include at least one selectable 
marker. Such markers include tetracycline or ampicillin resistance genes for 
culturing in E. coli and other bacteria. 

Among vectors preferred for use in the present invention include pQE70, 
pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from 
Stratagene; pcDNA3 available from Invitrogen; and pGEX, pTrxfus, pTrc99a, 
pET-5, pET-9, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. 
Other suitable vectors will be readily apparent to the skilled artisan. 

Representative examples of appropriate host cells include, but are not 
limited to, bacterial cells such as E. coli, Streptomyces spp., Erwinia spp., 
Klebsiella spp. and Salmonella typhimurium. Preferred as a host cell is E. coli, 
and particularly preferred are E. coli strains DH1 OB and Stbl2, which are available 
commercially (Life Technologies, Inc; Rockville, Maryland). 

Peptide Production 

As noted above, the methods of the present invention are suitable for 
production of any polypeptide of any length, via insertion of the above-described 
nucleic acid molecules or vectors into a host cell and expression of the nucleotide 
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sequence encoding the polypeptide of interest by the host cell. Introduction of the 
nucleic acid molecules or vectors into a host cell to produce a transformed host 
cell can be effected by calcium phosphate transfection, DEAE-dextran mediated 
transfection, cationic lipid-mediated transfection, electroporation, transduction, 
infection or other methods. Such methods are described in many standard 
laboratory manuals, such as Davis et al, Basic Methods In Molecular Biology 
(1986). Once transformed host cells have been obtained, the cells may be 
cultivated under any physiologically compatible conditions of pH and temperature, 
in any suitable nutrient medium containing assimilable sources of carbon, nitrogen 
and essential minerals that support host cell growth. Recombinant polypeptide- 
producing cultivation conditions will vary according to the type of vector used to 
transform the host cells. For example, certain expression vectors comprise 
regulatory regions which require cell growth at certain temperatures, or addition 
of certain chemicals or inducing agents to the cell growth medium, to initiate the 
gene expression resulting in the production of the recombinant polypeptide. Thus, 
the term "recombinant polypeptide-producing conditions," as used herein, is not 
meant to be limited to any one set of cultivation conditions. Appropriate culture 
media and conditions for the above-described host cells and vectors are well- 
known in the art. Following its production in the host cells, the polypeptide of 
interest may be isolated by several techniques. To liberate the polypeptide of 
interest from the host cells, the cells are lysed or ruptured. This lysis may be 
accomplished by contacting the cells with a hypotonic solution, by treatment with 
a cell wall-disrupting enzyme such as lysozyme, by sonication, by treatment with 
high pressure, or by a combination of the above methods. Other methods of 
bacterial cell disruption and lysis that are known to one of ordinary skill may also 
be used. 

Following disruption, the polypeptide may be separated from the cellular 
debris by any technique suitable for separation of particles in complex mixtures. 
The polypeptide may then be purified by well known isolation techniques. 
Suitable techniques for purification include, but are not limited to, ammonium 
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sulfate or ethanol precipitation, acid extraction, electrophoresis, 
immunoadsorption, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity 
chromatography, immunoaffinity chromatography, size exclusion chromatography, 
liquid chromatography (LC), high performance LC (HPLC), fast performance LC 
(FPLC), hydroxylapatite chromatography and lectin chromatography. 

Kits 

The present invention also provides kits for use in the synthesis, 
amplification, or sequencing of a nucleic acid molecule. Kits according to this 
aspect of the invention may comprise one or more containers, such as vials, tubes, 
ampules, bottles and the like, which may comprise one or more of the 
compositions of the invention. 

The kits of the invention may comprise one or more of the following 
components: (i) one or more compounds or compositions of the invention, (ii) 
one or more polymerases or reverse transcriptases, (iii) one or more suitable 
buffers, (iv) one or more nucleotides, and (v) one or more primers. 

It will be readily apparent to one of ordinary skill in the relevant arts that 
other suitable modifications and adaptations to the methods and applications 
described herein are obvious and may be made without departing from the scope 
of the invention or any embodiment thereof. Having now described the present 
invention in detail, the same will be more clearly understood by reference to the 
following examples, which are included herewith for purposes of illustration only 
and are not intended to be limiting of the invention. 



Examples 

Introduction 

4-methylmorpholine N-oxide (hereinafter referred to as "MMNO") 
was tested on a number of different PGR amplicons containing high GC content 
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(e.g. CAG repeats, Pseudomonas genomic DNA, etc.). The amplicons tested 
were difficult to amplify using current standard PCR reaction mixtures. To test 
the effectiveness of novel cosolvents on PCR performance from high GC-content 
amplicons, MMNO and other cosolvents were added to PCR reaction mixtures at 
different concentrations. 



Methods 

The following sections describe the preparation of the chemicals 
used in the examples. 

Preparation of 2. 2X PCR mixture (Examples 1-3): 

In Examples 1 -3, A 2.2X PCR mixture was prepared containing all 
the components listed below except the template DNA and primers. The 
following table illustrates how to prepare a 2.2X PCR mixture. 



Stock 



Solution 



IPX PCR Buf fer 
5P mM MgCl- 

lOmMdN TPs 

Tween 2P 

Nonidet P-4P 

Taq Polymerase 
5 units/uF 



dH,Q 



Vol. Added 

1.1 ml 
P.33 ml 
P.22 m l 
55 ul 
55 ul 
44 ul 

To 5 ml 



Final 
Co ncentr ation 

2.2X 

3.3 mM 

0.44 m M 

0.11% 

0.11% 

44 units/ml 



Materials for Examples 1-9: Betaine monohydrate 
([Carboxymethyl]trimethyl-ammonium), L-proline, 4-methylmorpholine-4-oxide 
(MMNO), ectoine (THP[B] ; [S]-2-Methyl-l,4,5,6-tetrahydropyrimidine-4- 
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carboxylic acid), DL-pipecolic acid (DL-2-Piperidinecarboxylic acid), and L- 
carnitine ([-]-b-Hydroxy-g-[trimethylammonio]butyrate) were purchased from 
Sigma (St. Louis, MO), and prepared as 4M stock solutions in sterile distilled 
water and filter sterilized. PCR reagents: Platinum Taq DNA polymerase, 
Platinum Taq DNA polymerase High Fidelity, 1 OX Taq Buffer (1 X = 20 mM Tris- 
HC1 (pH8.4), 50 mM KC1), 50 mM magnesium chloride, 10X Taq High Fidelity 
Buffer (IX = 60 mM Tris-S0 4 (pH 8.9), 18 mM (NH 4 ) 2 S0 4 , 50 mM magnesium 
sulfate, 10 mM dNTP Mix, K562 human genomic DNA, and sterile distilled water 
were obtained from Life Technologies, Inc. (Rockville, MD). Oligonucleotide 
primers were purchased as desalted preparations from Life Technologies, Inc. and 
were used without further purification. 



Example 1: Titration of Proline, l-methylimidazoleand 4-methylimidazole 

to Improve PCR Amplification from a GC-rich Template 

Several chemicals were tested to see if they would improve PCR 
performance using a GC-rich template. In the first example, the GC-rich template 
P55G12 was tested with various concentrations of one of 3 different chemicals: 
the amino acid proline, 1- methylimidazole and 4-methylimidazole. The following 
components were combined in a 0.2 ml tube: 13 ml of the 2.2X PCR mix, 0.5 ml 
of template DNA (10 pg), 0.5 ml of a primer mix (10 mM each) and 13 ml of a 
4M chemical solution (either proline, 1 -methylimidazole or 4-methyl- imidazole), 
and the solution was mixed by pipeting. The program for PCR was: 95 °C, 3 min; 
30 cycles of 94 °C, 30 sec; 55 °C, 30 sec; 72 °C, 1 min. After the PCR was done, 
5 ml of loading dye was added to each tube and 12 ml of the mixture was loaded 
onto an agarose gel for electrophoresis followed by ethidium bromide staining of 
the gel for the presence of DNA fragments. 

As shown in Figure 1, certain concentrations of each of these three 
chemicals performed better than others. In the case of proline, 300 to 600 mM 
gave optimal amplification of P55G12, whereas concentrations higher than 600 
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mix. 



mM gave no product. In the case of 1 -methylimidazole, 1 00 and 200 mM worked 
best but higher or lower concentrations either did not work at all or produced 
much less product. In the case of 4-methylimidazole, a slightly lower range of 
concentration improved amplification: 60 to lOOmM. Note that without the 
addition of these compounds to the PCR reaction there was no amplification 
product, and that 1M Betaine was effective at getting the reaction to be 
productive. 

Example 2: Titration of 4-methylmorpholine N-oxide (MMNO) for the 

PCR of a GC-rich Template: Comparison of MMNO and 
Betaine 

In Example 1, MMNO was identified as a novel reagent for improving 
PCR performance on GC-rich templates. The next issue to be addressed was the 
dependence of the performance of MMNO on its concentration in the PCR 
The GC-rich amplicon P55G12 was amplified as described above in Example 1, 
in the presence of 1M Betaine or of various concentrations of MMNO. 

As shown in Figure 2, inclusion of 400 to 1000 mM MMNO in the PCR 
reaction mix resulted in the production of a PCR product that was comparable in 
intensity to that of 1M Betaine. Concentrations of MMNO lower than 400 mM 
did not result in the production of PCR product. 

Example 3: PCR Amplification of High GC-content Amplicons: 

Pseudomonas aeruginosa Amplicons 

The genome of Pseudomonas aeruginosa contains high GC-content (70% 
GC) and is therefore challenging to amplify by PCR. Therefore, three different 
Pseudomonas aeruginosa amplicons ranging in size from 1.3- to 1.8-kb were 
tested in the PCR methods of the invention using betaine, MMNO, or 
trimethylamine N-oxide (TMANO) in the PCR reaction mixture. The same 
reaction preparation was used as described in Example 1, except that the template 
DNA in these reactions consisted of 30 ng of P. aeruginosa genomic DNA. The 
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following PCR program was used for DNA amplification: 95 °C, 5min; 35 cycles 
of 94 °C, 2 min, 58 °C, 30 sec and 72 °C, 2min. 

As shown in Figure 3, the Pseudomonas aeruginosa sequence D was 
amplified in the presence of betaine or MMNO, whereas sequences E and F were 
amplified in the presence of betaine, MMNO or TMANO. These results indicate 
that long, natural GC-rich sequences, such as those from the genome of 
P. aeruginosa, can be efficiently amplified using the compositions and methods of 
the present invention. 



Example 4: Comparison of Betaine, Proline and MMNO for Enhanced 

PCR Amplification of GC-rich templates 

Varying concentrations of betaine, proline and MMNO were examined for 
their efficacy of enhancing PCR amplification of a 1 56-bp fragment of human p53 
exon 10 (62.2% GC) or the 2782-bp coding region for DNA polymerase I gene 
from Deinococus radiodurans (66.7% GC). Reaction parameters were varied to 
assess effects of buffer composition and magnesium concentration. 

PCR amplifications were performed using thin-walled 0.2-ml tubes in 
50 ml reactions containing 2.5 U Platinum Taq, either IX Taq Buffer (20 mM 
Tris-HCl (pH8.4), 50 mM KC1) or IX Taq High Fidelity Buffer (60 mM Tris-S0 4 
(pH 8.9), 18 mM (NH 4 ) 2 S0 4 ), 200 mM of each dNTP, and 200 nM of each 
primer. Magnesium concentration, either magnesium chloride (Taq Buffer 
reactions) or magnesium sulfate (Taq High Fidelity Buffer reactions), was varied 
between 1.0 and 2.5 mM. The amount of each cosolvent (betaine, MMNO, or 
proline) was varied as indicated in each figure. Reactions were temperature cycled 
using either a Perkin Elmer model 9600 or 2400 Thermal cycler. For amplification 
of human p53 exon 10 sequence, reactions contained 100 ng K562 human 
genomic DNA and were incubated at 95°C for 1 min followed by 35 cycles of: 
denaturation at 95 °C for 30s; annealing at 60°C for 30s, and extension at 68°C 
for 1 min. For amplification of the DNA pol I gene, reactions contained 20 ng 
Deinococcus radiodurans genomic DNA and were incubated at 95°C for 1 min 
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followed by 35 cycles of: denaturation at 95°C for 30s; annealing at 55°C for 30s, 
and extension at 68°C for 3 min. 10-ml of each PCR were analyzed by agarose 
gel electrophoresis and ethidium bromide staining for the presence of the expected 
DNA fragment. 

As shown in Figure 4, successful amplification of the 156-bp human p53 
sequence was dependent on magnesium concentration and specific buffer 
conditions. In the absence of added cosolvent, specific product was not detected 
in reactions containing standard PCR buffer and was detectable in PCRs with 
ammonium sulfate buffer ( Taq high fidelity buffer) only at 1 .0 mM MgS0 4 . The 
addition of betaine, MMNO, or proline cosolvent improved the specificity and 
yield of amplification product over a broader magnesium concentration in both 
buffer systems. The effect of MMNO on the range of optimal magnesium 
concentration was less pronounced than that of either betaine or proline. 
Concentrations of betaine or proline which produced broad magnesium optima 
were higher in Taq buffer than in Taq high fidelity buffer. 

In contrast to results obtained with amplification of p53 exon 10, MMNO 
was highly effective at enhancing PCR of the longer 2.8-kb amplicon for Dra 
DNA poll over a broad magnesium concentration range of 1.0 - 2.5 mM 
(Figure 5). This effect was obtained for concentrations of MMNO between 0.4 
and 0.8 M and were similar to those observed for 1 M betaine. Addition of 
proline was also effective at enhancing amplification of the DNA poll fragment; 
however, the effective concentration range for proline was much narrower, and 
its effect on magnesium concentration range was less pronounced than that 
observed for betaine or MMNO. In general, higher concentrations of each 
cosolvent were required to enhance PCR in standard Taq buffer reaction than in 
reactions containing Taq high fidelity buffer. This is consistent with results 
obtained for amplification of p53 exon 10. Dra DNA poll PCR product was not 
observed in reactions which did not contain cosolvent. 
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Example 5: Mixtures of MMNO and Proline Enhance PCR Amplification 

of GC-rich Templates 

Since the results of the foregoing Examples demonstrated that proline was 
highly effective at enhancing reaction optima for amplification of short GC-rich 
templates and MMNO was effective at enhancing amplification of long GC-rich 
templates, mixtures of the two compounds were examined to see if they would 
provide enhanced amplification of GC-rich templates independent of fragment 
size. To test this possibility, GC-rich templates were amplified in the presence of 
compositions comprising proline, MMNO, or both. 

In mixture compositions, 4 M solutions of MMNO and proline were 
combined in ratios of 3:1, 2:1, 1:1, 1:2, and 1:3 respectively, to compose 4M 
hybrid cosolvent mixtures. These mixtures were then assayed for their effect on 
PCR amplification of p53 exon 10 and Dra DNA pol I. PCR reactions were 
performed in 50-ml volumes with Platinum Taq DNA polymerase in 1 X Taq high 
fidelity buffer as described above. Magnesium sulfate concentration was varied 
between 1.0 to 2.5 mM for each concentration of cosolvent tested. 
Concentrations of MMNO, mixtures of MMNO and proline, proline, and betaine 
are as indicated in each figure. 

As shown in Figure 6, mixtures of MMNO and proline were effective at 
enhancing specific amplification of the 156-bp p53 exon 10 fragment over a 
broader magnesium and cosolvent concentration range than that which was 
obtained with either cosolvent alone. Use of MMNO: proline mixtures were also 
highly effective at facilitating amplification of the 2. 8-kb Dra DNA poll fragment 
and significantly extended the effective magnesium and cosolvent concentration 
range over that obtained with proline alone (Figure 7). As was previously 
observed, MMNO enhanced PCR amplification over the full range of magnesium 
and cosolvent concentration tested. Collectively, these results demonstrate that 
the use of compositions comprising mixtures of N-alkyl carboxylic acids and N- 
alkyl amine oxides results in novel properties which can be exploited to enhance 
PCR amplification of GC-rich templates. Specifically, mixtures of MMNO and 
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proline combined in ratios from 2:1 to 1:2 can be used to enhance PCR 
amplification of GC-rich templates over a broad size range and increase the 
reliability of PCR over broader magnesium concentrations. 

Example 6: MMNO:proline Mixtures Enhance PCR of GC-rich 

Templates Over a Broad Annealing Temperature Optimum 

To assess the effects of PCR cosolvents on optimal annealing temperature 
during the PCR reaction, the GC-rich template P32D9 was amplified by PCR 
using the above-described MMNO:proline mixture compositions. PCR reactions 
were performed in 50-ml volumes using thin- walled 0.2-ml tubes (Stratagene, 
Inc.) in 50-ml reactions containing 2.5 U Platinum Tag, IX Tag High Fidelity 
Buffer (60 mM Tris-S0 4 (pH 8.9), 1 8 mM (NH 4 ) 2 S0 4 ), 1.5 mMMgS0 4 , 200 mM 
each dNTP, 200 nM each primer, 100 ng K562 human genomic DNA, and either 
or no added cosolvent, 0.5 M betaine, or 0.5 M 1:1, MMNO:proline. 
Concentrated MMNO: proline mixtures were prepared by mixing equal volumes 
of 4 M MMNO and 4 M proline. Annealing temperature optima were studied 
using a gradient block Robo-cycler (Stratagene) with a heated lid for oil-free 
operation. Following a 1 min denaturation at 95 °C, reactions were cycled 35 
times at 95°C, 45s; 55°-66°C, 45s, 68°C, 1 min. 1 0-ml of each PCR was analyzed 
by agarose gel electrophoresis (1% Agarose 1000, Life Technologies, Inc.) in 
0.5X TBE and ethidium bromide staining. 

As shown in Figure 8, in the absence of PCR cosolvent specific PCR 
product, a 149-bp, 78.5%GC fragment, was obtained only at 66°C. Product yield 
rapidly diminished resulting in amplification of non-specific product as annealing 
temperature was decreased . In contrast, both betaine and MMNO : proline mixture 
extended the effective annealing temperature range. Use of MMNO:proline 
mixture generated higher product yield than that obtained with betaine and 
permitted detection of specific product over the entire 66° to 55°C annealing 
temperature gradient. 



-51- 



Example 7: Use of MMNO:proline Mixture for Amplification of the 

Fragile X CGG Repeat Sequence 

MMNO: proline mixture was compared to betaine for its ability to facilitate 
PCR amplification of very high GC content DNA sequence. Primers were 
designed which bracketed the CGG repeat sequence of the human FMR-1 gene 
in the fragile X locus (GenBank Accession No. X61378). PCR amplifications 
were performed as described above, using 2.5U Platinum Taq DNA polymerase 
high fidelity, 100 ng K562 genomic DNA and IX Taq high fidelity buffer 
supplemented with 2 mM magnesium sulfate (final concentration). Aliquots of a 
4M MMNO:proline mixture prepared as described above, or 4 M betaine, were 
added to PCRs in varying amounts to produce 0.25 M to 2 M final concentration 
of either cosolvent. PCRs were incubated at 95 °C for 1 min., followed by 35 
cycles of: 95°C, 30s; 58°C, 30s; 68°C, 30s. Agarose gel analysis of the results of 
these studies is shown in Figure 9. 

The results of these studies demonstrate the superior ability of 
MMNO:proline mixture, as compared to betaine, to facilitate PCR amplification 
of extremely high (>80%) GC content target sequences. In the absence of PCR 
cosolvent, no specific PCR product was detected. However, a faint band of the 
expected size was visible in reactions containing 1.75 M betaine. In contrast, 
robust amplification of the CGG repeat sequence was demonstrated in reactions 
containing 1.5 to 2 M MMNO:proline. 

Example 8: Use of MMNO: proline Mixtures in Long PCR 

DNA polymerase mixtures composed of Taq DNA polymerase and an 
archaebacterial DNA polymerase possessing proof-reading activity have been used 
for amplification of DNA fragments up to 40-kb (Barnes, W.M. , Proc. Natl. Acad. 
Sci. USA 91:2216-2220 (1994)). The ability of MMNO: proline mixture to 
facilitate amplification of long GC-rich sequences was tested using primers 
designed to amplify 7.77-kb or 9.75-kb fragments of adenovirus type 2 DNA 
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(-60% GC). PCRs were performed using 1 pg of adenovirus type 2 DNA (Life 
Technologies, Inc.), IX Fa? high fidelity buffer supplemented 1.5 mM magnesium 
sulfate, and varying amounts (0 to 1M) of MMNO: proline mixture essentially as 
described above except that 2. 5U Platinum Taq DNA polymerase high fidelity, an 
enzyme blend of DNA polymerase from Thermus aquaticus and Pyrococcus 
species GB-D, was substituted for Platinum Taq DNA polymerase. Reactions 
were incubated for 1 min at 95°C, followed by 35 cycles of: 95°C, 30s, 58°C, 30s, 
68°C, 10 min. 

As shown in Figure 10, successful amplification of the expected 7.77-kb 
and 9.75-kb DNA fragments was dependent on inclusion of MMNO:proline 
cosolvent. Specific product was not detected in reactions without 
MMNO .proline; however, robust amplification and high product yield was 
obtained by inclusion of 0.25 M MMNO:proline mixture. Product yield in long 
PCR was sensitive to the amount of MMNO : proline used as product yield for both 
the 7. 3 and 9. 7-kb target decreased with increasing MMNO :proline concentration. 

Example 9: Comparison of Compensatory Solutes to Enhance PCR 

Amplification of GC Rich Templates 

A wide variety of amino compounds have been shown to serve a critical 
function in protecting organisms from osmotic stress. Betaine and proline are the 
major osmoprotectants in£. coli. Since both these compounds disrupt DNA helix 
stability and thereby facilitate amplification of GC rich templates, the effects of 
other known and commercially available osmoregulatory compounds in PCR were 
investigated. 

PCR mixtures were prepared in a volume of 50 ml, containing 2.5 U 
Platinum Taq DNA polymerase, 60 mM Tris-S0 4 (pH 8.9), 18 mM 
(NH4) 2 S0 4 ,1.5 mM MgS0 4 , 200 mM dNTP (each), 200 nM primer (each), 100 
ng K562 human genomic DNA and varying amounts of PCR cosolvents were 
prepared. Reactions were incubated at 95°C for 1 min, followed by 35 cycles of: 
95°C, 30s; 58°C, 30s; 68°C, 1 min. 
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Comparison of the efficacy of MMNO:proline mixture, betaine, 
L-carnitine; and DL-pipecolic acid are shown in Figure 11, while the results of a 
separate experiment in which ectoine was compared to betaine are shown in 
Figure 12. Experimental conditions were essentially the same as those described 
above except that the final reaction volume was 25 ml. 

All of the osmolytes examined in these experiments were effective at 
enabling amplification of the P32D9 sequence and demonstrate that a wide variety 
of N-alkyl carboxylic acid derivatives can be used to facilitate PCR amplification 
of difficult templates, such as templates that are high in GC content. 

Having now fully described the present invention in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without affecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 
intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
' as if each individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference. 



